Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 SIMT Execution
GPU Programming, Warp Divergence, Thread Blocks, CUDA Model
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
146347
posts in
66.2
ms
Optimizing
Recommendation Systems with
JDK
’s Vector API
netflixtechblog.com
·
3h
·
Discuss:
Hacker News
⚡
SIMD Optimization
Real-Time 3D
Engines
trendhunter.com
·
18h
🎮
Game Engines
GenDRAM
:Hardware-Software Co-Design of General Platform in
DRAM
arxiv.org
·
23h
🌊
Memory Bandwidth
WarpSpeed
automatically rewrites Nvidia core library, achieves 3.6-100x
speedup
doubleai.com
·
12h
·
Discuss:
Hacker News
⚡
Hardware Acceleration
How I Built a Full AI Studio for 6GB
VRAM
Cards (In 9 Hours of
AI-Assisted
Chaos)
hackernoon.com
·
7h
🧩
mimalloc
CUDA
From First
Principles
Part 2
pub.towardsai.net
·
3d
🖥️
OpenCL
Phantasm0009/accel-gpu
: NumPy for the browser GPU — zero shaders, zero dependencies
github.com
·
8h
·
Discuss:
Hacker News
🎮
WebGPU
Poking
a 200-Line GPT Until It Breaks (So You
Understand
Bigger Models Better)
dev.to
·
4h
·
Discuss:
DEV
🏷️
Pointer Tagging
OpenAI Codex-Spark Achieves Ultra-Fast Coding
Speeds
on
Cerebras
Hardware
infoq.com
·
13h
⚡
Hardware Acceleration
ISSCC
2026: Rebellions details industry's first quad-chiplet AI solution with UCIe interconnects — claims
Rebel100
AI accelerator equals the power of Nvidia H200 with lower power envelope
tomshardware.com
·
12h
⚡
Hardware Acceleration
Porting
AI Music Generation to NVIDIA
Jetson
hackster.io
·
2d
⚡
Hardware Acceleration
Microsoft
Shader
Execution
Reordering
Brings 90% Performance Increase on Intel Arc B-Series, 80% on NVIDIA "Blackwell" GPUs
techpowerup.com
·
19h
📊
Extrae
Simulating
Queueing
buttondown.com
·
9h
📮
Multi-producer Queues
Data Driven Optimization of GPU efficiency for Distributed LLM
Adapter
Serving
arxiv.org
·
23h
🎮
WebGPU
MoRI
— AMD's MoE dispatch and
KV
Cache library
github.com
·
1d
🔄
Glommio vs Tokio
A GPU
Microarchitecture
Optimized for Fully
Homomorphic
Encryption
semiengineering.com
·
1d
🔢
Homomorphic Encryption
Agentic
Engineering: Building Without
Writing
dehora.net
·
7h
🔲
Cellular Automata
Lazuli
Nightly (2026/03/02) (Game
Cube
emulator only) is release!
ngemu.com
·
13h
🎮
QEMU TCG
Nvidia is working on a chip for AI
inferencing
with
Groq
technology
techzine.eu
·
20h
⚡
Hardware Acceleration
TurboSparse
Efficiency: Achieving 97% Parameter Sparsity in
Mixtral-47B
hackernoon.com
·
1h
🤖
TVM
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help